arXiv: 1501.03419v3 [math.DS] 5 Oct 2015 


JOINT SPECTRAL RADIUS, STURMIAN MEASURES, AND THE 

FINITENESS CONJECTURE 

O. JENKINSON & M. POLLICOTT 


Abstract. The joint spectral radius of a pair of 2 x 2 real matrices (Ao,Ai) £ 
M 2 (K) 2 is defined to be r(A 0 ,Ai) = limsup^^ maxlH^ : ij £ {0,1}}, 

the optimal growth rate of the norm of products of these matrices. 

The Lagarias-Wang finiteness conjecture m, asserting that r(A 0 Mi) is always 
the nth root of the spectral radius of some length-n product ■ ■ ■ Ai n , has been 
refuted by Bousch & Mairesse [5], with subsequent counterexamples presented by 
Blondel, Theys & Vladimirov [2], Kozyakin [13], Hare, Morris, Sidorov & Theys 0. 

In this article we introduce a new approach to generating finiteness counterexam¬ 
ples, and use this to exhibit an open subset of M 2 (R) 2 with the property that each 
member (AoMi) of the subset generates uncountably many counterexamples of the 
form (Ao,Mi). Our methods employ ergodic theory, in particular the analysis of 
Sturmian invariant measures; this approach allows a short proof that the relation 
between the parameter t and the Sturmian parameter V(t) is a devil’s staircase. 


1. Introduction 

1.1. Problem and setting. For a square matrix A with real entries, its spectral radius 
r(A), defined as the maximum modulus of its eigenvalues, satisfies Gelfand’s formula 

r(A ) = lint ||A"|| 1/n , 

n—»• oo 

where || • || is a matrix norm. More generally, for a finite collection A = {A 0 ,..., A;} 
of real square matrices, all of the same size, the joint spectral radius r(A) is defined by 

r(A) = limsupmaxdlAjj • • ■ A in \\ 1/n : ij G {0,...,/}} , (1) 

n—> oo 

or equivalently (see e.g. m by 

r{A) = lim ma x{r(A h ■ ■ ■ A in ) 1/n : ij G {0,..., /}} . (2) 

n—H-oo 

The notion of joint spectral radius was introduced by Rota & Strang [2D], and notably 
popularised by Daubechies & Lagarias p] in their work on wavelets. Since the 1990s it 
has become an area of very active research interest, from both a pure and an applied 
perspective (see e.g. nunuaro. 

The set A is said to have the finiteness property if r(A) = r(A il ■ ■ ■ Aj n ) 1/,n for some 
i lr ..,! n G{0,...,l}. It was conjectured by Lagarias & Wang [15] (see also Gurvits [8]) 
that every such A enjoys the finiteness property. This so-called finiteness conjecture 
was, however, refuted by Bousch & Mairesse [5], and a number of authors (see MOSl 
m have subsequently given examples of sets A for which the finiteness property fails. 
A common feature of these finiteness counterexamples has been a judicious choice of 
a pair of 2 x 2 matrices Ao,Ai, followed by an argument that for certain t > 0, the 
finiteness property fails for the set Aft) = {Aq\ = {A 0 ,tAi}. 
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In fact for many of these examples it has been observed that the family (A(t))t>o 
can be associated with the class of Sturmian sequences of Morse & Hedlund TSi : 
for a given t > 0 an appropriate Sturmian sequence £ {0,1} N turns out to 

give the optimal matrix product, in the sense that the joint spectral radius r(A(t)) 
equals lim, woo r(A-*- ) ■ ■ ■ AfJ ) x / n (see [21 El :9] EE3> HZ] for further details). A Sturmian 
sequence (i n )™ =1 has a well-defined 1-frequency V = linpv-xx> jj J2n=i hi, and it is those 
sets A(t) whose associated Sturmian sequence^] have irrational 1-frequency which yield 
counterexamples to the finiteness conjecture. For certain such families (A(f))i>o (which 
henceforth we refer to as Sturmian families ), it has been proved by Morris & Sidorov 
m (see also [5J p. 109]) that if V(t) denotes the 1-frequency associated to A(t), then 
the parameter mapping t K > V(t) is continuous and monotone, but singular in the 
sense that {t > 0 : V(t) ^ Q} is nowhere dense; in other words, the uncountably many 
parameters t for which finiteness counterexamples occur only constitute a thin subseto 
of M + . 

Examples of Sturmian families (A(f))t>o have been given by Bousch & Mairesse [5], 
who considered the family generated by matrix pairs of the form 

A=^ Kh ° e + 1 2 > (J ,«>0,/i 0 ,/ii>0,/io + /ii<2, (3) 

by Kozyakin [13] . who studied the family generated by pairs of the form 

A = ((c d) ’ (o l) ) ’ ° < a ' d < 1 - h ° ’ ^ 

and by various authors focusing on the family generated by the particular 

pair 

A ={(\ ?)’ (1 !))' (5) 

For an invertible matrix P, the simultaneous similarity (Ao, Ai) i —> (P _1 AoP, P~ 1 A\P ) 
leaves invariant the joint spectral radius, and does not change the sequences (in,)^ 
attaining the optimal matrix product, while if u,v > 0 then (A 0 ,Ai) t-)- {uAq^vAi) 
again leaves unchanged the optimizing sequences. Therefore, declaring A = (A 0 ,Ai) 
and A' = (Aq, Aj) to be equivalent if A' 0 = uP~ 1 A 0 P and Aj = vP~ 1 AiP for some in¬ 
vertible P and u, v > 0, we see that the equivalence of A and Al implies that (A(t))t>o 
is a Sturmian family if and only if (A'(f))t>o is. In particular, (A'(f))t>o is a Sturmian 
family whenever A 1 is equivalent to a matrix pair A of the form (J3]), (|4]), or ([5]). 

The purpose of this article is to introduce an approach to studying the joint spectral 
radius and generating hniteness counterexamples, which in particular yields new ex¬ 
amples of Sturmian families (A(f))t>o, he. where A is not equivalent to a matrix pair 
of the form (|3]), (]4]) , or ([5]). Our method is conceptually different to previous authors, 
employing notions from dynamical systems, ergodic theory, and in particular ergodic 
optimization (see e.g. [TO]). Specifically, we identify a dynamical system T\ with the 
matrix pair A = (Ao, Ai), and cast the problem of determining the joint spectral radius 
r(A) in terms of ergodic optimization (see Theorem [2] below): it suffices to determine 

1 We follow the definition of Sturmian sequence given in [Bj, though note that some authors refer 
to these as balanced sequences, reserving the nomenclature Sturmian precisely for those balanced 
sequences with irrational 1-frequency. 

2 The belief that finiteness counterexamples are rare appears to be widespread; for example Maesumi 
[16] conjectures that they consitute a set of (Lebesgue) measure zero in the space of matrices. 
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the X^-invariant probability measure which maximizes the integral of a certain auxil¬ 
iary real-valued function /_ 4 . Working with the family of A-Sturmian measures (certain 
probability measures invariant under T 4 ) instead of Sturmian sequences, we exploit a 
characterisation of these measures in terms of the smallness of their support to show 
that they give precisely the family of fA(t )-maximizing measures, t > 0. In particular, 
whenever the /Up)-maximizing measure is Sturmian of irrational parameter V{t) then 
A(t) is a finiteness counterexample. 

The A-Sturmian measures are naturally identified with Sturmian measures on hi = 
{0,1} N , the full shift on two symbols (see Definition [4]). A notable feature of our 
approach is that the singularity of the parameter mapping 1 1 —> V(t) (and in particular 
the fact that {t > 0 : V(t) ^ Q} is nowhere dense in M + ) is then readily deduced (see 
Theorem [U] in THO) as a consequence of classical facts about parameter dependence of 
Sturmian measures on (i.e. rather than requiring the ab initio approach of m- 


1.2. Statement of results. We use M 2 (R) to denote the set of real 2x2 matrices, 
and focus attention on certain of its open subsets: 


Notation 1. M 2 (M + ) will denote the set of positive matrices , i.e. matrices in M 2 (M) 
with entries in M + = {x 6 M : x > 0}, and M£ (M + ) = {A G M 2 (M + ) : det A > 0} will 
denote the set of positive orientation-preserving matrices. 


Turning to pairs of matrices, we shall consider the following open subset of (M + ) 2 : 


Definition 1. Let £ C Mj l ”(R + ) 2 denote the set of matrix pairs 


<*•*>=(& *)’(“: *)) ^ + (« + ) 2 

satisfying 

do b\ 

Co d\ 

and 


( 6 ) 


di + C\ — b\ — d\ < 0 < do T 0 ) — ^0 — ^0 • (7) 

For reasons which will become apparent later (see Proposition HJ), C will be referred to 
as the set of concave-convex matrix pairs. 


Finally, our counterexamples to the Lagarias-Wang finiteness conjecture will be 
drawn from a certain open subset 2) (given by Definition [3] below) of € which is con¬ 
veniently described in terms of quantities qa and cr a defined as follows: 


Definition 2. For A 



G M+(K+), define 


and 


Qa 


2b 

a — d — 2b + \/(a — d) 2 + Abe ’ 


a a 


b — a 

a + c — b — d 


It turns out (see Corollary that if (A 0 , A x ) e <£ then ga 0 < 0 < Qa 0 and Qa x < — 1. 
The set D is defined by imposing two inequalities: 
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Definition 3. Define 

£ = { (A 0 , Ai) e£: QA ! < cr Ao and a Al < Qa 0 } ■ 

Clearly £ is an open subset of £, hence also of M 2 (M) 2 . It is also non-empty: for 
example it is readily verified that the two-parameter family 

® != {((c l)’(& j)):fc<l<c, (6,c)e(R + ) 2 } (8) 

is a subset of £. Note that the pair fl5J) studied in |2:, [9, IT7| , and corresponding to 
( 6 , c) = ( 0 , 1 ) in ([HD, lies on the boundary of both £ and £> 2 . 

A version of our main result is the following: 

Theorem 1. The open subset £ C M 2 (M ) 2 is such that if A = (A 0 , Ai) G £ then for 
uncountably many t G M + , the matrix pair (A 0 ,iAi) is a finiteness counterexample. 

If we define (8 C M 2 (M ) 2 to be the set of matrix pairs which are equivalent to some 
pair in £ (recall that A = (A 0 , A\) and A! = (A' 0 , A\) are equivalent if A' 0 = uP~ 1 AqP 
and Aj = vP~ 1 AiP for some invertible P and u,v > 0) then clearly: 

Corollary 1. The open subset (£ C M 2 (M ) 2 is such that if A = (A 0 , Af) G (H then for 
uncountably many t G M + , the matrix pair (A 0 ,tAi) is a finiteness counterexample. 

Remark 1. Theorem [T] yields new finiteness counterexamples, in the sense that £ 
contains matrix pairs which are not equivalent to pairs satisfying (13|) . (I4j) . or (l5]h To 
see this, note for example that 

belongs to £. Both Ao and A\ have their larger eigenvalue equal to 1, and smaller 
eigenvalues given by Ao = 9/16 and Ai = 13/16, respectively. Now both matrices in 
(|5D have the single eigenvalue 1, so (JSD cannot be equivalent to (J5]); moreover (J9J) is 
not equivalent to any matrix pair satisfying (J3J), since both matrices in (HD have the 
property that the larger eigenvalue is more than double the smaller eigenvalue. Lastly, 
we show that A in (jUJ) is not equivalent to any pair A' satisfying (j3|, i.e. A 1 = (Aq, A \) 

A\ = ^ ^, where 0 < a, d < 1 < be. Note that both Aq 

and A[ have their larger eigenvalue equal to 1 (as is the case for Ao and Ai), and 
smaller eigenvalues equal to cl and a, respectively. Thus if A and A' were equivalent 
then there would exist an invertible P such that Aq = P~ 1 AqP and A' x = P~ l A\P 
(i.e. the positive reals u,v in the above definition of equivalence must both equal 1 ), so 
that d = A 0 = 9/16, a = Ai = 13/16, and trace(A 0 A!) = trace^^A^). In particular, 
1 < be = trace(AgA , 1 ) — d — a = trace(A 0 A!) — A 0 — Ai = 12995/14336 < 1, a 
contradiction. It follows that A G £ given by ([9]) is not equivalent to any matrix pair 
satisfying (J4D. 

A key tool in proving Theorem |T] is the following Theorem [2] (proved in IJHas Theorem 
[4|) characterising the joint spectral radius of A G £ in terms of maximizing the integral 
of a certain function /a over the set of probability measures invariant under an 
associated mapping T 4 . More precisely, the action of any positive matrix A on (M + ) 2 
induces a projective map Ta (see H2.ll) . and if A = (A 0 , Af) G € then the inverses T^ 1 , 
T^ together define a two-branch dynamical system T 4 (see 1J4D on a subset of the unit 


with Ag = 


1 0 

c d 
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interval X. Defining the real-valued function /_ 4, in terms of the derivative and 
characteristic functions of the images T 4 0 (A") and Ta 1 (X), by 

f^ = \ ( 1 o S t a + (l°gdet AL 0 ) 1 Tao(X ) + (logdeti4i)l TAi( x)) 

then gives: 

Theorem 2 . If A E € then 

logr(*4.) = max 
/xGMa 

I 11 order to state a more precise version of Theorem [Q we first need some basic facts 
concerning ergodic theory, symbolic dynamics, and Sturmian measures: 

Definition 4. Let D = {0,1} N denote the set of one-sided sequences lu = (tu n )^L 1 , 
where c o n E {0,1} for all n > 1. When equipped with the product topology, D becomes 
a compact space, and the shift map a : D —>• D defined by (crcu) n = Cc i n +i for all n > 1 
is then continuous. Let M. denote the set of shift-invariant Borel probability measures 
on D; when equipped with the weak-* topology Xi is compact (see [23] Thm. 6.10]). 

We equip D with the lexicographic order <, and write [a; - , cu + ] = (w 6 O : co~ < 00 < 
cu + }. A Sturmian interval is one of the form [Ocu, la;], for some a > E fl, where a/ = acu 
(a cr-preimage of c 0 ) is given by a;] = a and u>' n = uj n -1 for n > 1. A measure n E M 
is called Sturmian (see e.g. [5, Prop 1.5], [6]) if its support is contained in a Sturmian 
interval. Let S C A4 denote the class of Sturmian measures on D. For a Sturmian 
measure fi E S, the value /x( [1]), denoted is called its (Sturmian) paramete r@, 

where [1] denotes the (cylinder) set {w E D : = 1}. 

The following are classical facts about Sturmian measures (see e.g. [5] § 1 . 1 ] or P|): 

Proposition 1. 

(a) For each Sturmian interval [Oo;, la;] C D there exists a unique Sturmian measure 
whose support is contained in this interval. 

(b) The mapping V : S —* [0,1] is a homeomorphism. If pi E S has V(gi) E Q then 
its support is a single cr-periodic orbit, while ifV(p) ^ Q then its support is a Cantor 
subset of Q which supports no other a-invariant measure (and in particular contains 
no periodic orbit). 

(c) If d{uj) denotes the Sturmian parameter of the Sturmian measure supported by the 
Sturmian interval [Ocu, leu] C D, then the map d : D —> [0,1] is continuous, non¬ 
decreasing, and surjective. The preimage d _ 1 ('P) is a singleton if V is irrational, and 
a positive-length closed interval if V is rational. 

For example the Sturmian measures of parameter 1/2, 1/3, 2/5, 3/8 and 5/13 are, 
respectively, supported by the cr-periodic orbits generated by the finite words 

01 , 001 , 00101 , 00100101 , 0010010100101 , 

whereas the Sturmian measure of parameter (3 — \/ 5)/2 is supported by the smallest 
Cantor set containing the cr-orbit of 

0010010100100101001010010010100101 ... 

3 This corresponds to the 1-frequency mentioned in S3 sometimes called the 1-ratio (see e.g. 0 
ED, or the rotation number (see e.g. 0). 
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In view of Theorem [2l for a matrix pair A G £ we are interested in measures v G M.a 
attaining the maximum in (TTU|> . i.e. satisfying J f^dv = max /lGMA f f^dfjb) such u will 
be called / ^-maximizing. There is a topological conjugacy between T 4 and the shift 
map u : hi —y f 2 , and this induces a natural homeomorphism between and A4; 
the image of any /^-maximizing measure under this homeomorphism will be called 
a maximizing measure for A. We then say that A = (A 0 , Ai) G £ generates a full 
Sturmian family if the set of maximizing measures for the family Aft) = (Ao,tAi), 
t G M + , is precisely the set S of all Sturmian measures on hi. 

A more precise version of our main result Theorem |T] is then the following: 

Theorem 3. Every matrix pair in the open subset D C M 2 (M ) 2 (and hence the open 
subset <£ C M 2 (M) 2 j generates a full Sturmian family. 

Note that Theorem [H] will follow from a more detailed version, Theorem [HI which in 
particular incorporates the statement that the parameter map t 1 — y Vft) is a devil’s 
staircase. 

1.3. Relation with previous results. The methods of this paper can also be used to 
give an alternative proof of some of the results mentioned above, namely establishing 
the analogue of Theorem [3] in certain cases treated by Bousch & Mairesse [5] and 
Kozyakin [13], and the case considered by Blondel, Tlieys & Vladimirov [2], Hare, 
Morris, Sidorov & Theys [9], and Morris & Sidorov ini- 

As already noted, the matrix pair ([5]) lies on the boundary of our open set D, and 
clearly it also lies on the boundary of the set A C M 2 I "(M + ) 2 defined by Kozyakin’s 
conditions ([3]). It can be checked that & itself lies in the boundary of our set £, but 
not in the boundary of D. However, the subset A! C A defined by 



( 11 ) 


can be readily checked to lie in the boundary of D. Matrices in the Bousch-Mairesse 
family (J3J) do not all satisfy our condition (JH]), or indeed the corresponding weak in¬ 
equality, so do not automatically belong to the boundary of €. However, imposing the 
additional condition 


6 2k > (gisho + !)( e ^i + 1) 


( 12 ) 


ensures that a matrix pair satisfying ((3J) belongs to the boundary of £, and indeed also 
belongs to the boundary of D. In H7.2I we will indicate the minor modifications to our 
approach needed to handle the case of (J3J), and the sub-cases of (J2D and ([3]) defined by 
(IT2|) and (fill) respectively. 

1.4. Organisation of article. The article is organised as follows. Section [2] consists of 
preliminaries: maps induced by matrices acting on projective space, Perron-Frobenius 
theory, and some useful notation and identities. Section [3] develops the notions of pro¬ 
jective convexity and projective concavity. Section [3] introduces the induced dynamical 
system T 4 for concave-convex matrix pairs A , and the formulation of joint spectral 
radius in terms of ergodic optimization (Theorem [4]) . Section [5] introduces Sturmian 
measures and Sturmian intervals for the dynamical system T 4 , while Section [ 6 ] estab¬ 
lishes the existence of an important technical tool, the Sturmian transfer function. 
After deriving some explicit formulae for extremal Sturmian intervals in Section [TJ. the 
key Section [ 8 ] establishes the link between Sturmian intervals and the parameter t of 
the pair Aft). Section O treats the case of those parameters t such that one matrix in 
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the pair A{t) dominates the other, so that the joint spectral radius r(A(t)) is simply 
the spectral radius of the dominating matrix. All other parameters are considered in 
Section [TUI establishing that the joint spectral radius is always attained by a unique 
Sturmian measure. Finally, in Section [TT] we show that the map taking parameter 
values t to the associated Sturmian parameter V(t) is a devil’s staircase. 


2. Preliminaries 

2.1. The induced map for a positive matrix. 

Notation 2. Throughout we use the notation X = [0, 1]. 

A positive matrix A G M 2 (M + ) gives a self-map v ha Av of (M + ) 2 . This lifts to a 
self-map A : [u] ha [Av] of projective space (M + ) 2 / ~, the equivalence relation ~ being 
defined by v ~ v' if v = sv' for some s > 0, and [u] denoting the equivalence class 
containing v G (M + ) 2 . It is convenient to identify projective space with 


E = 


x 

1 — X 


: x G (0,1) 


so that the projection n : 


2 —> E takes the form 

/ X \ 


7T : 


HA 


/ x+y ] 

l -K- j ’ 

\x+y / 


and the projective map is represented as n o A : E —> E, taking the explicit form 

( a—b)x+b 


71 O A 


X 

1 — X 


HA 


(a+c-b—d)x+b-\-d 
(c—d)x+d 


k (a-\-c—b—d)x+b-\-ds 

This projective mapping is completely determined by its first coordinate, thereby mo¬ 
tivating the following definition of the self-map Ta of the unit interval X = [0,1]: 


Definition 5. For A = 

by 


a b 
c d 


T a (x) = 


2 ), the induced map Ta : X —> X is defined 

(a — b)x + b 


the induced image Xa is defined by 

Abi = T a (X) = 


(a + c — b — d)x + b + d ’ 
b a 


b + d' a + c 

and the induced inverse map S A ■ X A —> X is given by 

(b + d)x-b 


S A (x) = Tx\x) = 


-(a + c — b — d)x + a — b 


Remark 2. Defining P = 

respectively, by the matrices 

PAP' 1 = 


1 0 
1 1 


, the Mobius maps T A and S A are represented, 


a — b b 

a + c — b — d b + d 


and P l AP = 


b + d —b 
b + d — a — c a — b 










O. JENKINSON & M. POLLICOTT 


Remark 3. The objects defined in Definition [5] do not change if the matrix A is 
multiplied by a positive real number; that is, if t > 0, A E M 2 i "(M + ), then T tA = T A 
(hence S tA = S A ), and X tA = X A . 


Lemma 1. For A = 


c d 


E M 2 t_ (M + ), the map T A has a single fixed point p A = 


T a (pa ) m X, given by the formula 


Pa = 


d — 2b + \/ (a — d ) 2 + 46c 
2 (a + c — b — d) 


(13) 


Proof. Uniqueness follows from the fact that A has all entries strictly positive, and the 
formula (TT3T) is a straightforward computation. □ 


2.2. Notation and matrix preliminaries. For a matrix A 
it will be useful to write 



e m+(k+), 


a a = a + c — b — d , 

Pa = cl — d — 26 , 

7 a = y/(a - d) 2 + 4 be, 

noting that these quantities are related by the following identity: 
Lemma 2. For A E M 2 t "(M + ), 


(14) 

(15) 

(16) 


7 Pa = iba * ■ 

Proof. Straightforward computation. 


(17) 

□ 


For ease of reference it will be convenient to collect together various previously 
defined objects expressed in terms of the above notation. 

Proposition 2. For A E Mf(R + ), 


Qa — 


a A = 
Pa + 7a 

PA = - 


2b 

Pa + 7 a 
b — a 


ola 


bo a 


2a A (b — a)g A ’ 

aAX + b + d 

(b + d)x-b 
Sa{X ) = ---;-r , 


(18) 

(19) 

( 20 ) 


£ 


(Ao,A!) eM+(M H 


c 0 d, 
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2.3. Some useful formulae. The purpose of this short subsection is to collect to¬ 
gether various formulae which will prove useful in the sequel. Firstly, we have the 
following two expressions for the determinant of A involving a a and a a- 

Lemma 3. For A G (M), its determinant can be expressed as 

a 


and 


det A = —ct^a + c) 


det A = —aA(b + d) 


a + c 
b 

b T d 


+ a 


+ <74 


Proof. Straightforward computation. 

There is a useful alternative way of expressing the quantity qa- 
Lemma 4. For A G M 2 l “(M + ) 7 


Qa — 


7 a ~ Pa 
2 a A 


and qa is the larger root of the quadratic polynomial qa defined by 

q A (z) = a A z 2 + (3 a z -b. 

Proof. The expression ([23]) follows from (fl 8 |) and the identity (ITT]) . 
The larger root of qa is computed to be 

ft a + \J Pa + 4 _ ~Pa + 7 a 


2a A 


2 OLA 


again using 03- 
Clearly 


q A {z) = det 


1 z 

—claz Paz — bl ’ 
though the following expression will prove to be more useful: 

Lemma 5. For A G M 2 I “(M + ) , 


q A {z) = det ( , , 1 / A , ] . 

\b + d — <aaz (a — b)z — bj 


( 21 ) 

( 22 ) 

□ 

(23) 

(24) 


□ 


(25) 


□ 


Proof. Straightforward computation. 

2.4. Perron-Frobenius theory and the joint spectral radius. 

Lemma 6. The dominant (Perron-Frobenius) eigenvalue Xa > 0 of the matrix A = 
° c G M+(M+) is given by 

Xa = ~ (a + d + 7 a) — -h a — b, 

2 pa 

with corresponding left eigenvector 

w A = ( a - d + 7 2 b ), 

and right eigenvector 

PA 


v A = 


1 ~ Pa 
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The derivative of T 4 at its fixed point p a is related to the determinant and Perron- 
Frobenius eigenvalue of A as follows: 


Lemma 7. If A G M 2 t "(M + ) then 

T' a (Pa) 


det A 
A 


Proof. Straightforward computation using the expression pa = (see (120])), the de¬ 

rivative formula T\ (a;) = det A (a a'X + b + d)~ 2 , and the fact that Xa = f (a + d + 74 ) = 
\{Pa + 7 a ) +b + d (see (ED). ' □ 


Since the Perron-Frobenius eigenvalue is also the spectral radius r{A), we obtain 
the following corollary: 


Corollary 2. If A G M 2 I "(M + ) then its spectral radius r(A) satisfies 


r(A) = 

Proof. Immediate from Lemma [71 


det A 


T'Apa )') 


1/2 


(26) 

□ 


Notation 3. Let us write finite words using the alphabet {0,1} as i = (ii,..., i n ), 
and their length as |i| = n. Let hi* denote the set of all such finite words; that is, 
O* = U n >i{0, l} n . Given A = (A), Af) G M 2 (M) 2 , and i G {0, l} n , let A[i) denote the 
product 

A(i)=A h ---A^. (27) 

Corollary [2] then allows us to express the joint spectral radius of a matrix paii0 
A = (A 0 , Ai) G M 2 i_ (M + ) 2 in terms of induced maps of the products A(i) as follows: 


Proposition 3. If A 

satisfies 


(A, A) £ ^ / h 2 l "(M + ) 2 , then its joint spectral radius r(A) 


r(A) = sup 

i£fl* 


det A(i) 

T A(i)(PA(i)) 


i/2|*| 


(28) 


Proof. The expression (J 2 J) for the joint spectral radius can be written as 

r(A) = sup {r(Ai • ■ • A in ) l/n : n > l,ij G {0,1}} = sup r(A(i )) 1/l - 1 , 

ien* 


so applying Corollary [ 2 ] with A replaced by A(i) yields the result. 


□ 


3. Projective convexity and projective concavity 


Remark 4. 

(a) The derivative formula 

Ta( x ) = det A (oiAX + b + d)~ 2 (29) 

implies that if A G M 2 f (M + ) 2 then Ta 0 and Ta 1 are orientation preserving. 

4 There is an obvious analogous formula for more general finite sets A. 
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(b) The second derivative formula 

T'a(x) = — 2a a det A [oiax + b + d)~ 3 (30) 

implies that if A G £ then T A < 0 and T Ai > 0, i.e. Ta 0 is strictly concave and Ta 1 is 
strictly convex. 

Part (b) of Remark 0] motivates the following definition, partitioning M 2 I "(M + ) into 
two subsets: 

Definition 6. A matrix A G M 2 I “(M + ) will be called projectively convex if the induced 
map Ta is strictly convex, and projectively concave if the induced map Ta is strictly 
concave. 


Remark 5. The set M^(R + ) is the disjoint union of the subset of projectively convex 
matrices and the subset of projectively concave matrices. 


Recall that 


w A = {w ( £,w^) = {a-d + 'y A ,2b) (31) 

denotes the Perron-Frobenius left eigenvector of A G M^(R + ), and that (consequently) 


the right eigenvector for the other eigenvalue of A is 
the following identity: 

Lemma 8. For A G M^(R + ), 


VJ 


( 2 ) 


-W 


( 1 ) 


It is useful to record 


Qa 





Proof. Immediate from (fl 8 l) and (j3lil . 


(32) 

□ 


Corollary 3. For A G M 2 I "(R + ), if Q G M 2 ( R) is non-singular then Qq-^aq — Qa', that 
is, qa is invariant under similarities. 


Proof. Immediate from Lemma [HI and the fact that the eigenvector wa is invariant 
under similarities. □ 


There are various useful characterisations of projective convexity and projective con¬ 
cavity: 

Lemma 9. For A G M^(R + ) ; the following are equivalent 

(i) A is projectively concave, 

(ii) a A > 0 , 

(iii) Qa > 0, 

(iv) w A > w A . 

Proof. As noted in Remark 0] (b), the second derivative formula (1301) yields the equiv¬ 
alence of (i) and (ii), since det A > 0 , and a function is strictly concave if and only if 
its second derivative is strictly negative. 

To prove the equivalence of (ii) and (iii), we consider separately the cases where 
/3a > 0 and [3 a < 0 . If /3a > 0 then a a = /3a + b + c > 0 , so we must simply show 
that Qa > 0 . But 7 a > 0 by definition, hence /3a + 7a > 0, and therefore (fTSD implies 
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that qa = ;j4 + 7A > 0, as required. If on the other hand /3a < 0 then 7 a [3a > 0 is 
automatically true, again since 7 a > 0 by definition. Using and (fTHjl we see that 

2 a A QA = 1 a ~ (3a > 0 , 

so indeed a a > 0 if and only if qa > 0, as required. 

/ o\ 

Lastly, the equivalence of (iii) and (iv) is immediate from (132|) . since w' A >0. □ 

Lemma 10. For A G the followmg are equivalent 

(i) A is projectively convex, 

(ii) a A < 0, 

(iii) q a < -1, 

c \ (i) ^ (2) 

(iv) w A < vj a . 


Proof. A function is strictly convex if and only if its second derivative is strictly positive, 
so the equivalence of (i) and (ii) follows from (130|) . since det A > 0 and + b + d = 
a + c + {b + d)(l — x) > 0 for all x G A". 

To prove that (iii) is equivalent to (iv), note that (1321) gives w^ A = w A \ 1 + Q~a)] 
therefore qa < —1 if and only if 1 + g A x G (0,1), if and only if w A 1 G (0,w^' ) ). 

Lastly, to prove the equivalence of (ii) and (iii), it follows from Lemma [9] that a a < 0 
if and only if qa < 0, but this latter inequality in fact implies vf A — w A ^ > 0 by (1321) . 
so 


Qa 


— 1 - 





< - 1 , 


as required. □ 

Note that in Lemma flOl the assertion is not merely that qa < 0, but that qa < — 1; 
this should be contrasted with the inequality qa > 0 in Lemma [9j 
It is now clear why € is described as the set of concave-convex pair 


Proposition 4. The set € consists of those matrix pairs (Aq, A\) G M 2 f (M + ) 2 such 
that Aq is projectively concave, A\ is projectively convex, and the induced image for A 0 
is strictly to the left of the induced image of Ai. 


Proof. Lemmas |9] and [TO] imply that the inequality < 0 < oa 0 in Definition [Tj is 
equivalent to Aq being projectively concave and A\ being projectively convex. The 
inequality ^ in Definition Q] is equivalent to T Ao ( 1) = = p Al { 0), 

which asserts that the right endpoint of the induced image Xa 0 is strictly to the left 
of the left endpoint of the induced image X Al - □ 

Lemma 11. If A G M^(M + ) is projectively concave then a a < 0. 


Proof. Projective concavity of A means that a a > 0, so by (1T91) it suffices to show that 
b < a. Since det A = ad — be > 0 and a A = a + c — b — d > Owe derive 


, , be 

a — b > d — c > — 
a 


c = - (a 

a 


b), 


or in other words 

(a — b ) ^1 H—> 0 , 
and hence a — b > 0 , as required. 


□ 


5 Note, however, the restriction that the induced images be disjoint, with the concave image to the 
left of the convex one. 







JOINT SPECTRAL RADIUS, STURMIAN MEASURES, FINITENESS CONJECTURE 


13 


We can now prove the following result mentioned in §1,21 (note, however, that there 
is no constraint on the sign of ga x when (A 0 , Ax) G £): 

Corollary 4. If (A 0 , Af) G £ then ga 0 < 0 < qa 0 and qa x < — 1. 

Proof. Immediate from Lemmas [9] [TQ1 and |TT] □ 

An important result is the following: 

Lemma 12. If A G M 2 f (M + ) then 

— a a (x + a a) > 0 for all x G Xa ■ (33) 

In particular, if A G M 2 I_ (M + ) is projectively concave then 

x + a a < 0 for all x G Xa , (34) 

and if A G M^(M + ) is projectively convex then 

x + a a > 0 for all x G Xa ■ (35) 

Proof. Clearly fl33l) follows from (1341) and (135|) . since a a is positive if A is projectively 
concave, and negative if A is projectively convex, by Lemmas I9l and flOl 

To prove (l34j) . note that — 0^(0 + c) + a a) = det A > 0 by (l 2 Tj) . and if A is 
projectively concave then a a > 0, so -ffj- + a a < 0. But ^ is the righthand endpoint 
of Xa, so if x G Xa then x < therefore x + a a < ^ + a a < 0, as required. 

To prove (j35l) . note that — «a(& + d ) + ga) = det A > 0 by (1221) . and if A is 

projectively convex then a a < 0, so + cr A > 0. But ^ is the lefthand endpoint of 
Xa, so if x G Xa then x > therefore x + a a > ^ + a a > 0, as required. □ 

Corollary 5. If (A 0 , A\) G £ then x + cr^ < 0 for x G Xa 0 , and x + typ > 0 for 
x G Xa 1} and — 0 ^( 0 : + cr^J > 0 for all x G Xa.,, i G {0,1}. 

Proof. Immediate from Lemma fl2l □ 

Lemma 13. If A = (A 0 , Ai) G € then 

Ui(Qa 0 ) < 0 < q Ao (QA J • 

Proof. The larger root of qa x is Qa x , by Lemma [H It follows that qA x (^) = ola x z 2 + 
/3a x z — bi < 0 for all z > qa x , since the leading coefficient < 0 , since Ai is 
projectively convex. But by Lemmas IU1 and [TUI we know that Qa 0 > 0 > —1 > Qa x , so 
indeed qA^dAo) < 0 , as required. 

The smaller root of qA 0 , which we shall denote by r^ 0 , is given by 

— (7A 0 + Pm) 


It follows that 

qA 0 (z) = a Ao z 2 + /3 Ao z -b 0 > 0 for all z < r Ao , (36) 

since the leading coefficient «a 0 > 0? since A 0 is projectively concave. Now qa x < — 1 
by Lemma flOl and if we can show that ta 0 > — 1 then it follows that Qa, < va 0 , and 
hence qA 0 {qa x ) > 0 by ([36]). 

To show that indeed ta 0 > —1, note that this inequality is equivalent to 2<aa — (3a > 
7 a- Both sides are positive, so this is equivalent to (2 a a — (3a) 2 > 1a, which using (flTl) 
becomes 4 «a(«a — (3a) > AbctA ■ This latter inequality is equivalent to (Xa — (3a > b, 
which is true because in fact a a — (3 a = b + c > b. □ 
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We deduce the following technical lemma, which will be used in 1J9) 

Lemma 14. If A G € then the Mobius function 

x + Qa 0 

ijq |_ y _ 

(bi + di- a Al Qa 0 )x + (ai - bf)g Ao - h 
has strictly negative derivative, while the Mobius function 

x + q a i 

rjQ |_ y _ 

(bo + do — a Ao g Al )x + (ao — bo)g Al — bo 
has strictly positive derivative. 

Proof A general Mobius map x H» ''f '(( has derivative D(Rx + S)~ 2 , where D = 
PS — QR, so the derivative is strictly negative if D < 0, and strictly positive if D > 0. 
For our first Mobius map we have 

1 Qa 0 

K b i + d\ — a Al g Ao (ai — b\)g Ao — b\^ 

by Lemma El and q Al (g Ao ) is strictly negative by Lemma fT51 so the derivative of the 
map is strictly negative, as required. 

For our second Mobius map we have 


D = det 


— q Al (£u 0 ) 


D = det 


bo + dc 


~a Ao g Al (a 0 -b 0 )g Al -bo) 


by Lemma El and q Ao (g Al ) is strictly positive by Lemma 
map is strictly positive, as required. 


so the derivative of the 

□ 


4. The induced dynamical system for a concave-convex matrix pair 
4.1. The induced dynamical system and joint spectral radius. 

Definition 7. For a matrix pair A = (Ao, Af) G £, define the induced space X A to be 

x A = x Ao u X Al , 

and define the induced dynamical system T A : X A -A X by 

S A o(x) if xe X Ao 
S Al (x) if x e X Al • 


Ta(x) = 


Remark 6. 

(a) The map T A : X A -A- X is Lipschitz continuous since X A = X Ao U X Al is the 
union of disjoint intervals X Aq and X Al , and the restriction of T A to X Aj is the Mobius 
mapping S Ai , which is certainly Lipschitz continuous. 

(b) Note that the (surjective) induced dynamical system T A is naturally defined as a 
mapping from X A to X = [0,1]. To view it as a surjective se//-mapping of some set 
(the natural setting for a dynamical system) we consider its restriction to the induced 
Cantor set Y A := r\ n >oT^ n (X A ), and note that T A : Y A -A Y A is topologically conjugate 
to the shift map a on Q — {0,1} N . 

Proposition 5. If A — (A 0 , A\) G € then its joint spectral radius r(A) satisfies 


r(A) = sup (detA(f) (T l j)'(p A ^) 

iefi* ' 


1/2111 


( 37 ) 
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Proof. From Definition [7|we see that T 4 o T Ai is the identity map on X, for i G {0,1}, 
since T 4 is defined in terms of the inverses S Ai = T/. 1 - Similarly, for any f G {0, l} n 
we see that o Ta(j, is also the identity map on X, so (TJJ)'(TA(p(^)) T'^Jx) = 1 for 
all a; G A", by the chain rule. Setting x = p A ($ = TAU)(j>A(i)) we obtain 


(^)'(PA©) 


1 

T A(i)(PA{i )) ’ 


and combining this with f|28|) gives the required formula (13?|) . 


□ 


4.2. Invariant measures for the induced dynamical system. 

Definition 8. For A G (£, let A4^ denote the set of TAinvariant Borel probabil¬ 
ity measures on A" = [ 0 , 1 ]; the support of any such measure is contained in Ya = 
n n > 0 Ta u {x a ). 

Remark 7. The topological conjugacy — > Ya between the shift map a : Q — > O 

and T a '■ Ya — >• Ya (cf. Remark [ 6 ] (b)) induces a one-to-one correspondence h* A : M. —* 
M.a between invariant measures. 


Definition 9. For a bounded Borel function / : Xa — » R, a measure m G M.a is 
called / -maximizing if 

/ / dm = sup / f dp . 

/xEA'I.a J 

In the generality of Definition [UJ the notion of an /-maximizing invariant measure is 
part of the wider field of so-called ergodic optimization, see e.g. m- 

Definition 10. For A = ( A 0 , A) G £, define the induced function fA '■ A \ —> R by 


That is, 


fA= 2 ( lo S r A + (log det A 0 )1 Xaq + (logdet A)1 .y A i ) 

f \ (log S' Aq (x) + log det A 0 ) if x G X Aq 
H (log S' Al (x) + log detail) if x G X Al , 


(38) 


(39) 


so writing ^ = 


62-2 


gives 


Ci di 

f A (x) = log 


det A 


+ cr Ai ) 

where we recall from Corollary [5] that — detAi 


-a A , (x+a A , ) 


for x G X A ., 

> 0 for all x G X A ■ 


(40) 


Remark 8. The function [a is clearly Lipschitz continuous on each X At , hence Lips- 
cliitz continuous on Xa = X Ao U X Al , since the intervals X Ao and X A] are disjoint. 

The reason for introducing the function fA is provided by the following characteri¬ 
sation of the joint spectral radius in terms of ergodic optimization: 

Theorem 4. If A = (A, A) £ <£ then its joint spectral radius r(A) satisfies 


logr(A) = max / f A dp. 

J 


( 41 ) 
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Proof. From Proposition [5] we have 


logr(A) = sup log (deti4(i) {T^)'{p A (i)) 


1/2111 


If i e {0, l} n then 

log (det A(i) (T%)\pA(i))) 1/2llJ = ^ (logdet (A h ■ ■ ■ A in ) + log(T;()'(p A(i) )) 


n —1 


— ( !°g II det At. + log Yl T' A {T 3 A (p Aii) )) 


3 = 1 


3=0 


1 ^ 1 


- o ( lo S det A k+ 1 + l °sT A (Tf(p A& ))) 


n 2 

3=0 
n—1 


-£yup>*cP). 


t=o 


where the last step uses Q3SJ) together with the fact that 

log det A ij+1 = ((log det Ao) lx Ao + (logdet Ai) 1 Xai ) (T J A (p A(i) )), 

because 

1 if ij .|_i = k 


Combining 


1 x Ak (T\(p m )) = 

and (1431) gives 

III— 1 


0 if ij .|_i 7 ^ k . 


where 


lil-i 


lH -~ \i\ ^2 S T 3 A (PA(i)) 
l-l j =0 


(42) 


(43) 


logr(*4) = sup pr V fA(T^p A{i) )) = sup f f A dp k , (44) 

iEfi* 11.| q iEO* J 


is the unique measure in A4 a whose support equals the periodic orbit {T A (p A ^)}^ Q l . 
By a result of Parthasarathy pT9] (see also Sigmund eh). the collection of periodic 
orbit measures {ji,_ : i G fl*} is weak* dense in the weak* compact space M. Al so 


sup / fA df-k = max / f A dp, 


ien* 




and combining with (ITT]) gives the required equality (|4Tj) . 


□ 


Remark 9. If A G € has the finiteness property, and i e { 0 , l} n is such that r(A) = 
r(A(z)) 1 / n , then the corresponding periodic orbit measure 

1 lil-i 

= u[ S Tf( PA{i) ) e M a 
i-i j =o 

is /^-maximizing, i.e. it attains the maximum in (HI]) : logr(A) = max^jn^, f f A dp = 
f fA dpi- If on the other hand A G £ does not have the finiteness property then, 
although at least one v e A i A will be /^-maximizing, such a measure will not be 
supported on a single periodic orbit. Nevertheless, v may be chosen to be an ergodic 
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measure (since it is readily shown that the set of /^-maximizing measures is convex, 
and any of its extremal points is ergodic), so that the ergodic theorem (see e.g. [23, 
Thm. 1.14]) implies the existence of p G Y A such that 


n—1 


logr(A) = / f A dv = lim f A (T 3 A (p)) 

I n—>-oo u zJ 


3=0 

The topological conjugacy h A : O —> Y A identifies p with a sequence u> = h' A {p) G fl, 
with the property that 

r(A) = lim r(A Ul ■ ■ ■ A Uri ) 1/n , 

n —>-oo 

and the image measure m = {hf^)*{v) (a maximizing measure for A in the terminology 
of (HJ satisfies 


g A dm = max / g A dl, 
leM I 


where g A := f A °h A . 


4.3. Monotonicity properties and formulae. The following simple lemma records 
that for A G £, the induced dynamical system T A ^ is independent of t, and that the 
induced function f A ^ differs from f A only by the addition of a scalar multiple of the 
characteristic function for the image X Al . 

Lemma 15. For A = (d 0 , A{) G €, and all t > 0, 

(i) T A u) = T a , 

(ii) f A (t ) = f A + (logt)l XAi , 

(iii) f A (t){T A 0 (1)) - f A (t)(T Al ( 0)) = f A (T A 0 (1)) - f A (T Al ( 0)) - logt, 

( iv ) f A (t) = &> mth 

f A ( x ) = -( x + cr A i )~ 1 for x G X A ., i G {0,1} . (45) 

Proof, (i) From Remark [3] we see that if t > 0 then T tAl = T Al , hence T A m = T A . 

(ii) Formula (139|) gives f A = f A ( t ) on X Aoi while for x G X Al we have 

f A (t) (x) = ^ (log S' Al (x) + log det tAi) = log t + f A (x) 

since logdett^! = log (t 2 det Af) = 2 log t + log det A \, thus f A g_) = f A + (logt)lx A • 

(iii) This is immediate from part (ii). 

(iv) The formula for f A follows readily from the explicit formula (140]) for f Al and is 

equal to f' A(t: . by (ii) above. □ 

Lemma 16. If A = (Aq,Ai) G C then 

(i) f A is strictly positive on X Ao and strictly negative on X Al , 

(ii) f A is strictly increasing on X Ao and strictly decreasing on X Al , 

(iii) (f A ° T A J{ X ) > 0 and (f A o T l Ai )'(x) < 0 for all i > 1, x e X. 

Proof, (i) In view of formula (l45lh it suffices to note that by Corollary [5] x + cr Ao < 0 
for x G X Ao , and x + a Al > 0 for x G X Al . 

(ii) This is an immediate consequence of (ii). 

(iii) By the chain rule, 

{/a o «/(*) - fjfiMMtXn*) ■ 


(46) 
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The second factor (T\ )'(x) on the righthand side of (1461) is strictly positive for all 
x G X, i > 1, j G {0,1}, since TA . is orientation-preserving, as noted in Remark [2 
Regarding the sign of the hrst factor f\{T\.{x)) on the righthand side of (146|) . note 
that since i > 1 then T\ (x) G = T Aj(X) for all x G X. Part (i) above then implies 
that f^{T\.(x)) is strictly positive when j = 0 and strictly negative when j = 1. It 
follows that (/_4 o T\)'(x) is strictly positive when j = 0 and strictly negative when 
j = 1 , as required. □ 


For the purposes of the following Lemma [T7I it will be convenient to introduce the 
following notation: 


Notation 4. For a matrix A = 

5 A = 


a b 
c 

b T d 


d j G M+(R+), define 
b T d 


a a a + c — b — d 

We can now give another characterisation of qa- 
Lemma 17. For A G M 2 I "(M + ), 

qa = lim 5 A k . 

k—> oo 


Proof. Perron-Frobenius theory (see e.g. [23l Thm. 0.17]) gives 


lim A A k A k = vwa = 

k—> oo 




) 


where the positive dominant eigenvalue > 0 and corresponding left eigenvector wa 
are as in Lemma [HI and v is a corresponding right eigenvector (a suitable multiple of 
Va from Lemma El), normalised so that Wav = 1 . 

It follows that 


lim 5a fc 

k —^oo 






w 


( 2 ) 


W 


( 1 ) 


— U! 


( 2 ) > 
A 


(47) 


so the formula Qa = m A m from Lemma [S] concludes the proof. 


□ 


Corollary 6. For A G M 2 + (M + ) 7 


£(io g s>r;)'0r) 

n =1 


2 

x + Qa 


(48) 


Proof. A simple calculation using the chain rule yields 

k 

£(logS' o T2Y(x) = -(log r At )\x) = —G- (49) 

n= 1 A 

for all k > 1, so letting k —> oo we see that the result follows from Lemma [171 □ 

Recalling from (l39j) that J’a — 17 (log S' A . + logdet Afj on Xa v the following result is 
an immediate consequence of Corollary EJ 
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Corollary 7. If A = (A 0 ,Ai) G M 2 I "(M + ) 2 then for i G {0,1}, 

OO 1 

E(U° T i)'U) = -r- forxeX A ,. (50) 

IS I + 

Corollary 8. If A = (A 0 , Ai) G £ then for all x G A", 

OO OO 

E<U°7X)'U)>0 and ^(IaoTaJMcO, (51) 

n=l n=l 

and 

x + ^A 0 > 0 and x + qa x < 0 . (52) 

Proof. The inequalities in (|5lT) follow from Lemma fl6l (iii). while (1521) is an immediate 
consequence of (1501) and (|5TT) . □ 


Remark 10. The inequality x + qa 0 > 0 in (|52l) can also be deduced from the fact 
that qa 0 > 0 (by Corollary BJ and x > 0. 

5. STURMIAN MEASURES ASSOCIATED TO A CONCAVE-CONVEX MATRIX PAIR 

For A G £, the induced space A a becomes an ordered set when equipped with the 
usual order on X = [0,1]. In particular, by a sub-interval of Xa we mean any subset 
of A _4 of the form I n Xa where / is some sub-interval of A". Note that a sub-interval 
of Xa is a sub-interval of A" if it is contained in either A"a 0 or Xa 1 ', otherwise it is a 
union of two disjoint intervals in A". 

Definition 11. Given a matrix pair A G £, a closed interval T C Xa is called A- 
Sturmian (or simply Sturmian ) if T^minT) = ^(maxT), i.e. its two endpoints minT 
and rnaxT have the same image under the induced dynamical system t a - 

Remark 11. 

(a) The topological conjugacy Iia '■ ^ —* Ya (of- Remark [TJ) is order preserving, so if 
T C A _4 is an *4-Sturmian interval, then (T fi Ya) is a Sturmian interval as defined 
in Definition H] (i.e. of the form [Ocu, leu] for some w G h). 

(b) For all t > 0, an interval is .4-Sturmian if and only if it is *4.(f)-Sturmian. 

Definition 12. Let Xa denote the collection of all yf-Sturmian intervals. Note that 
I A is naturally parametrized by X = [0,1]: for each c G X there is a unique T G Xa 
such that T^(minr) = T^max T) = c. Henceforth we shall write ca(T) to denote the 
common value T^min T) = ^(max T) for an ^4-Sturmian interval T G X 4 , noting that 

C A : 2U ~t A" 

is a bijection. As a subset of A, we can express T G Xa as 

r = [T*(c^(r)),T*(i)i u (TX.( 0 ), TVMr))]. ( 53 ) 

Remark 12. It is apparent from (|53l) that, viewed as a subset of X = [0,1], an A- 
Sturmian interval T is always a disjoint union of two closed intervals. Note, however, 
that for the two extremal cases where ca(F) = 0 or 1, one of the intervals in the 
disjoint union is a singleton set (and the other interval is, respectively, either Xa 0 or 
AaJ. These extremal cases are particularly significant, and in the calculations of SJ7] 
onwards it is convenient to neglect the singleton set, thereby identifying the extremal 
.4-Sturmian interval with either A"a 0 or A A 1 - 
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Definition 13. We say that a ^-invariant Borel probability measure on is A- 
Sturmian if its support is contained in some M-Sturmian interval. Let 1 S 4 denote the 
collection of M-Sturmian measures. 

Remark 13. 

(a) I 11 view of Remarks 171 and ITTl the class of yf-Sturmian measures on V 4 is just the 
/i^-image of the class of Sturmian measures on the shift space 12, i.e. £4 = K t^- 
In particular (cf. Proposition [T|(b) ), 1 S 4 is also naturally parametrized by X = [0,1]: 
the map V o (h ^) _1 : £4 —> [0,1] is a homeomorphism, and for /j G £4 we refer to 
V o (h^) _ 1 (/i) = n(XAf) as its (Sturmian) parameter. 

(b) For all t > 0, a measure is M-Sturmian if and only if it is «4.(t)-Sturmian. 


6. The Sturmian transfer function 


In order to show that the maximizing measure for is supported in some A- 
Sturmian interval Y G I4, our strategy will be to add a coboundary <pr — (pr°T^, where 
the corresponding Sturmian transfer function cpr is introduced below, so that the new 
function /a + (fr — ipr 0 T 4 takes a constant value on all of T, and is strictly smaller 
than this constant value on the complement of T. This approach is patterned on ideas 
of Bousch [1] in the setting of the angle-doubling map and degree-one trigonometric 
polynomials. 

To proceed, it is convenient to introduce the following: 


Definition 14. For A G £, to each M-Sturmian interval I we associate the hybrid 
contraction r r : X —> X^, defined by 


rrO) 


T Al (x) if x G [0,c(r)) 
T Ao (x) if x G [c(r), 1]. 


(54) 


Remark 14. The hybrid contraction rr satisfies Tr(X) = T, and is piecewise Lipschitz 
continuous. More precisely, its restriction to [0, 04 (F)) is Lipschitz, as is its restriction 
to [c. 4 (r),i]. 


Lemma 18. Given A G and an A-Sturmian interval Y G Z 4 , there exists a unique 
Lipschitz continuous function : X —> 1R which simultaneously satisfied 

OO 

p'a,t = £o* ° rf) 1 Lebesgue a.e., (55) 

n= 1 

and 

^,r(0) = 0. (56) 

Proof. The function is Lipschitz, and rr is piecewise Lipschitz (cf. Remark [HI) . so 
each Tp is piecewise Lipschitz, so by Rademacher’s Theorem is differentiable Lebesgue 
almost everywhere, with L°° derivative. Now ||(rr ) , || 00 = 0(6 n ) as n 00 for some 
6 G (0,1), so the sum 

OO OO 

£(U°v”)' = £/i°v”W)' 

n =1 n=l 


6 The substantial condition is (j55|) . which determines <pa,t up to an additive constant. The extra 
condition (l5f)l) is useful in that it removes any ambiguity when discussing PA.r- 
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is Lebesgue almost everywhere convergent (as its nth term is 0(9 n )), and defines an 
L°° function with respect to Lebesgue measure on X. In particular, it has a Lips- 
cliitz antiderivative (pr, which is the unique Lipschitz antiderivative up to an additive 
constant, hence uniquely defined if it satisfies the additional condition <p^ t r( 0 ) = 0 . □ 

Notation 5. For A G £, Y G Ta, the function (p p = TA,v whose existence and unique¬ 
ness is guaranteed by Lemma [18] will be referred to as the corresponding Sturmian 
transfer function. 

Remark 15. Note that although the induced function /,4 is only defined on Xa, the 
Sturmian transfer function ipr is actually defined on all of X = [0,1]. For the most part, 
however, we shall only be interested in the restriction of yjp to Xa- More precisely, we 
shall be interested in certain properties of fA + Tv, or °f /a + Tv — Tv 0 Ta, considered 
as functions defined on Xa, beginning with the following Corollary [HI 

Corollary 9. If A G €, and Y is any A-Sturmian interval, then both fA + Tv and 
fA + Tv ~ Tv 0 Ta are Lipschitz continuous functions on Xa- 

Proof. Both T a and fA are Lipschitz continuous on Xa, as noted in Remarks EH and [SJ, 
and tpr is Lipschitz continuous on X as noted in Lemma [TH] hence Lipschitz continuous 
on Xa- It follows that both fA + Tv and /m + Pr — Pr 0 Ta are Lipschitz continuous 
on Xa- □ 

Lemma 19. Suppose A G £, t > 0, and Y is any A-Sturmian interval. The Lipschitz 
continuous function $A{t) + Tv — Tv 0 Ta '■ Xa —> M has the property that its restriction 
torn Xa 0 is a constant function, and its restriction to V D X^ 1 is a constant function. 

Proof. By Corollary [9] the function fA(t) + p>r — Tr 0 Ta is Lipschitz continuous on Xa, 
because Aft) G £. So by the fundamental theorem of calculus for Lipschitz functions 
(see e.g. [121 Thm. 7.1.15]), the required result will follow if it can be shown that 

(fA{t) +(pr ~ <Pr° T a )' = 0 Lebesgue a.e. on Y . (57) 

But f' A ^ = f' A , so ([57]) is equivalent to proving that 

(/ + ip r — ip T o Ta) 1 = 0 Lebesgue a.e. on Y . (58) 

To establish this almost everywhere equality, note that 

OO OO OO 

fA + f'r = S'a + ° if)' = XX/U ° Tf)' = Y. f'A 0 T ! Ml?)' 

71— 1 71—0 71—0 

and 

OO OO 

(V>r ° Ta)' = J2fA °t?o Ta.(t?)' o T a T a = ^/> 

71—1 71—1 

since (r r o Ta)' = Tp -1 , so indeed (I55|) holds. □ 

Remark 16. In the generality of Lemma fl9l the constant values assumed by fA{t) + 
Tv ~ Tv 0 Ta on Y D Xa 0 and Y D X^, do not coincide. However, we shall shortly give 
(see Lemma [22]) an extra condition which does ensure that fA(t) + Tv ~ Tv 0 Ta takes 
the same constant value on the whole of T. Indeed this possibility is a key tool in our 
strategy. 
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7. The extremal Sturmian intervals 


7.1. Formulae involving extremal intervals. As noted in Remark [I^l an M-Sturmian 
interval is the disjoint union of two closed intervals when viewed as a subset of X = 
[0,1]. However, the two extremal cases yield a leftmost A-Sturmian interval equal 
to Xaq U (27^(0)}, and a rightmost A-Sturmian interval equal to (Ta 0 (1)} U Xa, ■ 
The presence of singleton sets in these expressions is notationally inconvenient, and 
unnecessary for our purposes, so henceforth we neglect them. 

More precisely, henceforth the leftmost M-Sturmian interval is taken to be Xa 0 = 
Ta 0 (X ), and denoted by r 0 , so that r ro = T Ao ] the rightmost M-Sturmian interval is 
taken to be Xa 1 = T Al (X), and denoted by Id, so that r ri = TR. 

When the M-Sturmian interval Y is either T 0 or I'd, there is an explicit formula for 
the Sturmian transfer function ip?: 


Lemma 20. Suppose A G £. For i G {0,1}, and all x G X, 


y Vi (x) = log 


f x + g Ai \ 

V QAi ) 


(59) 


Proof. Now rr, 


Ta^ so the dehning formula (|55l) becomes 


FVi( x ) = J2(fA°T2J\x) , 

77.—1 


(60) 


and then (j50li implies that 


p'vi i x ) 


1 

X + 0 Ai 


Noting that the sign of x + Qa , is positive when i 
Corollary [8]), as well as the convention that <^(0) 
the required expression (l59]h 


0 and negative when i = 1 (see 
0 (see Definition we deduce 

□ 


Definition 15. Given A = (A 0 , A±) G (£ and Y G X 4 , dehne A_ 4 (r) G 1 by 

Aa(F) = (^r(l) - ^r(0)) - (<^r(TA 0 (l)) - ^(Ta^O))) , (61) 

noting the equivalent expression 

A»(r) = Vr(l) - (Vr(A„(l)) - Vr(T Al (0))) (62) 

as a consequence of the convention that (^r(O) = 0 (see Lemma ITSTl . 


The values A^R) play an important role, so it will be useful to record the following 
explicit formulae: 


Lemma 21. Suppose A G For i G {0,1}, 


AaGA) 


tog ( (1 + g * ) (Ar + ^.) j . 

V e* (srfc; + a».) ) 


(63) 


Proof. This is immediate from the dehning formula (16T| (or (l62|l i for A^(R), together 
with formula (j59l) for ( pr t , and the fact that 7 a 0 (1) = and T^^O) = b b f dl • □ 

Corollary 10. If A G £ then 

a^QA) < 0 < A^(r 0 ). 
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Proof. The four terms g Ail 1 + Q Ai , ^f^ + QAi, £ 7^7 +Ah in © are all positive if f = 0, 
and all negative if i = 1, by the inequalities (lS2]i in Corollary [ 8 j Now A G £ implies 
that ([6]) holds, so ^ 7^7 + g Ai > 4 . g A ^ and clearly 1 + g Ai > g Ai Consequently 

(1 + QaJ + QA i) 

Qa z + QM) 

is strictly greater than 1 if i = 0, and strictly smaller than 1 if i — 1. The result then 
follows from Lemma 1211 □ 

7.2. Adaptations for other matrix pairs. As mentioned in §1.31 the methods of 
this paper can be adapted so as to give alternative proofs of certain results (analogues 
of Theorem [3]) mentioned in §TJ namely establishing that a full Sturmian family is 
generated by the matrix pair ()5j). and for matrix pairs corresponding to sub-cases 
of ([3]) and (HD which lie in the boundary of 2>Q In this subsection we indicate the 
modifications necessary to handle these cases. 

Firstly, the induced space X A may be the whole of A" = [0,1] rather than a disjoint 
union of two closed intervals: this occurs if a 0 /c 0 = b\/d\ (i.e. when (J 6 J) becomes an 
equality), which is the case for the pair (jSJ), and for (HD if be = 1 . 

Secondly, in each of the cases ([2D, (J5D and (HD, the induced maps T Ao and T Al have 
fixed points at 0 and 1 respectively, so that the dynamical system T 4 also fixes these 
points. For ([2D, both 0 and 1 are indifferent fixed points, i.e. T' Aq (0) = 1 = T' Ai ( 1). 
For ([2D and (HD these fixed points are unstable for the induced maps T Ao and T Al , 
i.e. Tjj o (0) > 1 and T' Ai (1) > 1, but both of these maps also have stable fixed points 
in the interior of X = [0,1]. Consequently for Q the dynamical system T A : X —> X 
has indifferent fixed points at 0 and 1 , and no other fixed points, while for (J2D and (HD 
the dynamical system T 4 has stable fixed points at 0 and 1 , and two further unstable 
fixed points in the interior of A". 

The potentially problematic stable fixed points for T 4 can in fact be avoided by 
omitting to consider the two extremal A-Sturmian intervals: this ensures the asymp¬ 
totic || (TpD'lloo = 0{6 n ) as n -A- 00 , 6 e (0,1), and the existence of Sturmian transfer 
functions is proved as in Lemma fl 8 l In the case where T A has indifferent fixed points, 
it is even possible to consider extremal A-Sturmian intervals, as the series defining the 
Sturmian transfer function is nonetheless convergent. The existence of Sturmian trans¬ 
fer functions then allows the remainder of the method of proof to proceed essentially 
as for matrix pairs in 3D, ultimately establishing analogues of the main result Theorem 

El 


8. Associating A-Sturmian intervals to parameter values 

Notation 6. For a Sturmian interval r g x Al let Sr G denote the A-Sturmian 
measure supported by T, i.e. Sr is the unique T^-invariant probability measure whose 
support is contained in T. 

Lemma 22. Suppose Ag t If t G M + and T G X A are such that 

fm(T Aa (l)) - fAt,)(T At m = A A (rj , ( 64 ) 

7 Note that all of the matrix pairs in EL ©, © have the property that A 0 is projectively concave 
and A 1 is projectively convex. 
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then the Lipschitz continuous function f A ^ + p T — tp r o T A is equal to the constant 
value f f^ t ) dsr when restricted to T. 

Proof. By Lemma [T9] we know that f A ( t ) + Pr ~ Pr 0 T A is constant when restricted to 
T fl X Ao , and also constant when restricted to T D X^ 1 . To prove that these constant 
values are the same, it suffices to show that f A ( t ) + pr — (fir 0 T A takes the same value 
at the point Ta 0 (1) G X Ao as it does at the point T Al (0) G X Al - But the equality 

(/-4(t) + Pr ~ Pr 0 T a ) (T Ao ( 1)) = (f A (t) + pr - Pr 0 T A ) (T Al { 0)) 

holds if and only if 

fA(t)(T Ao (l)) - f A (t)(T Al ( 0)) = ^r(l) - V ? r(0) - (^r(T4 0 (l)) - ^(^4,(0))) , 

in other words ./^p) (T Ao (1)) — fA(t)(T Al ( 0)) = A^r), which is precisely the hypothesis 
(1641) . □ 

Corollary 11. Given Tg £, if t G M + and T G X A are such that 

k *(ferf) t ' I ) = A *< r >' (65) 

then the Lipschitz continuous function f A ^ + pr — Pr 0 T A is equal to the constant 
value f f A (t ) dsr on T. 

Proof. By Lemma [521 it suffices to show that 

f Am (T Aa (i)) - o)) = log ((yipf) r ‘) ■ 

and by Lemma flSlfiiil this is equivalent to showing that 

UT M ( 1 )) - Sa{TaM) = log (jfzp) ■ 

Substituting T Ao (l) = ao a f CQ and T Al (0 ) = bi b f di into, respectively, the formulae (j40l) 
for f A 011 X Ao and X Al yields 

f A (T Ao (l)) = log(a 0 + c 0 ) (66) 

and 

/^(T j4l (0))=log(6 1 + d 1 ), (67) 

so the result follows. □ 

In view of equation (jSS]) we make the following definition: 

Definition 16. For A G € and i G {0,1}, define U = U(A) by 

(68) 

so that 

k >s((^)v 1 )=A.(r.). 

Remark 17. Since e _A - A To) < 1 < e - A ^( p i) by ([HID , it follows that 

to(A) < ti(A). 
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Lemma 23. For A G £ and i G {0,1}, 

QAi (oo + QAiia o + Co)) 


U(A) = 


(1 + QAi) (b\ + QAiip 1 + di)) 
Proof. From (FTTI) we see that for i G {0,1}, 


(69) 


3 -A^(Ti) _ 


Pa i [ + QAi 


(1 + Q Ai ) (^dT + &**) 


so that (1681) gives 


f (A) = ( a ° + C ° ^ — A^( r i) _ QAj («o + QAi (do + Cp)) 

\bi + di) (1 + QAi ) (&i + QAi(bi + c/i 


)) 


which is the required expression 

A consequence is the following property: 
Corollary 12. For A G <£, t G K + , and i G {0,1}, 

ti(A) 


(70) 

□ 


ti(^l(t)) = 


t 


(71) 


Proof. This follows easily from (169]) , and the easily verihed fact (used only in the proof 
of the i — 1 case) that QtA x = Qa x - Specihcally, for i G {0,1}, 

QAi (c-o + QAi{a>o + c o)) 1 QAi {ao + QAi(ao + cq)) U(A) 


ti(A(t )) = 


(1 + QAi) (tbi + QAi (tb\ + tdi)) t (1 + QAi) {bi + QAi(b\ + di)) t 


□ 


Lemma 24. For A G £, the quantities to(A) and ti(A) admit the following alternative 
expressions: 

det A 0 


to{A ) — 


and 


ti(A) = 

Proof. Since (170]) implies 


(a 0 - b 0 ( 1 + f? A *)) (di + 6i(l + ^Ao)) 
(°o + c 0 (l + QaI) X ) (°i — ^i(l + £hq)) 


to(A) — 


det Ai 

a-o + QA 0 (a o + cq) 


(1 + Qa 0 ) (di + bi(l + QaI)) 
we see that to(A) is equal to (172]) if and only if 

a>o + QAo (®o + c 0 ) _ aodo — &o c o 

1 + Qa 0 a 0 — b 0 (l + QaI) 

Clearing fractions in (175]) reveals it to be equivalent to the equation 

Qa 0 (qa 0 ) = cha 0 Qa 0 + PaoQAo ~~ bo = 0 , 

which is true by Lemma HI 
Since (1761) implies 

QAi ( a o + c 0 (l + Qa\) *) 


(72) 

(73) 

(74) 

(75) 


ti(A) = 


b\ + QAi (b± + d\) 


(76) 
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we see that £i(4.) is equal to (173]) if and only if 

QAi _ °i ~ bi(l + g A ^) 

b\ + g Al (61 + d\) det A\ 

Clearing fractions in (1771) reveals it to be equivalent to the equation 

Qa 1 {qa 1 ) — c^Ai^i + @a 1 Qa 1 — bi = 0 , 
which is true by Lemma [4] 


( 77 ) 


□ 


Notation 7. For A G £, let T A denote the open interval (£ 0 (4.), £1 (4.)). 


Proposition 6. Let A G £. For each t G T A there exists an A-Sturmian interval 
V A (t) G X A such that f A ^ + ipY t ~ Fv t 0 T A is equal to the constant value f f A {o dsr A (t) 
on r A (t). 


Proof. First we show that A^ : T 1 —> A^(T) is continuous. The formula 
^A(r) = ^r(l) - Pr(T Ao (l)) + (p r (T j4l (0)) 

o 0 \ , ( h 


defines 


— <^r(l) — Fr 


o-o + Cq 


+ Ft 


h\ + d[ 


so the continuity of A^ will follow from the fact that T 1 —> (pr{z) is continuous for each 
z G X. To see this, first note that Definition 1181 gives 


<Pr(z) = Pr(z) - <Pr(0) = 
and re-writing this integral as 


pz pZ 00 

Fr— 0 r f')') 

Jo Jo n=1 


00 „ z 00 « 00 « « 00 

E Oa ° r ^) , = E / f'A = E / U r -|o,,|P = / f ' A E UfM 

n=l ^ n=l n=l j J n= 1 


gives 

VrM = | q/yr), (78) 

where 

OO 

^~( r ) = 1 t t[M ' 

n=l 

Now each map 77~ )n : T i—>■ l r n[ Q)Z ] clearly belongs to C([r 0 , ri], L 1 ), the space of 
continuous functions from [r 0 , Ti] to L 1 = L 1 (dx), and Y^=i-^z,n is convergent in 
C([r 0 , ri], L 1 ), so H z (-) G C([r 0 , Ti], L 1 ). It then follows from (1781) that T H > is 

continuous, as required. 

Now note that the function G A defined by 


G A (t) 


log 


f a 0 + Co\ ! 
\bi + d\) 


is strictly decreasing, since a 0 ,c 0 ,bi,di > 0, so if £ G 1~ A = (£ 0 (4), £i(4)) then 


(79) 


G A (t) g {G A (tM))iG A (t,(A))) = (A^ro, A^(r 0 )). 


(80) 
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Now is continuous, so applying the intermediate value theorem to this function 
(defined on the interval [r 0 , Id]) we see that in view of (I80[) . there exists an T-Sturmian 
interval, which we denote by T A (t), such that r_ 4 (f) G (r 0 ,r!) and 

X A (T A (t)) = G A (t). (81) 


In other words, 

Ios ((uTt) r ‘) =a - 4 (D(« ) ), 

so that Corollary QT] implies that f A ( t ) + <Pr t ~ Pr t ° T A = f f A ( t ) ds TA ( t ) on V A (t), as 
required. □ 


9. The case when one matrix dominates 


It will be useful to record the value of the induced function f A at the two fixed points 
of T a : 


Lemma 25. For A G £ and i G {0,1}, 

det Aj 


f A {p Ai ) = log 


a t - bi( 1 + qA) 


= log 


det Aj 


b i ~ UPAi+lAi) 


Proof. Straightforward computation using (fT 8 j) . (1201) . and (HOjl . 


□ 


We hrst consider a sufficient condition for the projectively concave matrix To to be 
the dominant matrix of the pair A = (T 0 , Af)\ 


Theorem 5. If A G € is such that 


to{A) > 1, 


(82) 


then the Dirac measure at the fixed point p Ao is the unique f A -maximizing measure; in 
particular, the joint spectral radius of A is equal to the spectral radius of A 0 . 

Proof. Choosing (p(x) = <pr 0 ( x ) = log ( ~~~ j ensures, by Lemma ITU1 that f A + <p — 

ip oT 4 is constant when restricted to X Ao — ho, and the constant value assumed by this 
function is clearly f A (p A 0 )- The result will follow if we can show that f A + if — p °T A 
is strictly decreasing on X Al , and that the value (f A + cp — p o T 4)( 6 ) at the left 
endpoint of X Al is no greater than the constant value f A (pA 0 )- This is because the Dirac 
measure 5 PAq will then clearly be the unique maximizing measure for f A + (p — p o T A , 
and hence the unique maximizing measure for f A . 

To compute the value (f A + ip — p o T A ){ , h f, ) we recall from (1671) that 


f A 


bi 


h\ + d] 


and note that 


and 


7 


bi 


b\ + dj 


= log 


= f A {T Al ( 0)) = log( 6 i + di ), 

= 7 ( 0 ) = 0 , 

(di + bi(l + p A g)) 



= log 


b\ + d\ 
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Therefore 


By Lemma 


{fA + <P ~ P ° t a) 


bi 


b\ + di 


— l°g (di + b±(l + £?Ao)) 


fAiPAo) = log 


det An 


ao - 6o(l + QaI ) J 

so (IMP and (jMj) imply that the desired inequality 

lh 


(83) 

(84) 


(/j + *>-»>» Ta) 

is precisely the hypothesis (l82lh since 

to{A) = 


b\ + d\ 
det A 0 


< f A (PA 0 ) 


(a 0 - &o(l + Paq)) (di + ^i(l + £Ao)) 


by 

It remains to show that f A + <p — p ° T A is strictly decreasing on X Al . Suppose 
x G X Al . We know by (JUD that 

det A± 


Now 


so 


f A (x) = log 

< p(x ) = log 
(p{T A {x)) = log 


(AAx (X + (Tax) ) 

X + Qa 0 


and therefore 

Ua + P - ( P°Ta){x) =log 

It therefore suffices to show that 

x i—^ 


Qa 0 

Sax{x) + Qa 0 
Pa 0 

_det A\{x + q A q) _ 

-a Al (x + <7A 1 )(-SU 1 (a;) + Pa 0 ) 

x + qa 0 


-a Al (x + + Qao) 

is strictly decreasing. For this note that 


(85) 


0 , , , ( 6 i + di)x- 6 i (&i + di - a Al QA 0 )x + ( a i “ bi)g Ao “ 6 i 

*Jyli \X) + QAq ~ -7-;- 7 + Qa q — - 


so 


-a Al (x + a Al ) 
is seen to be the Mobius function 


-a Al (x + a Al ) 


x (->• 


x + Qa 0 


(h + d 1 - a Al QA 0 )x + K - b^pAo - h ’ 
which is known to be strictly decreasing by Lemma [bH 

As a consequence of Theorem [5] we obtain: 


□ 


Corollary 13. If A G £ and t G M + are such that 

t<t 0 (A), (86) 

then the Dirac measure at the fixed point pa 0 is the unique f A (t) -maximizing measure; 
in particular, the joint spectral radius of Aft) is equal to the spectral radius of A 0 . 
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Proof. The assumption (l 86 |) means, using dTTj) . that t 0 (A(t)) > 1, so the result follows 
by applying Theorem 0 with A replaced by A{t). □ 

We now turn to an analogous sufficient condition for the projectively convex matrix 
Ai to be dominant: 

Theorem 6. If A G €. is such that 

ti(A)<l, (87) 

then the Dirac measure at the fixed, point p Al is the unique f^-maximizing measure; in 
particular, the joint spectral radius of A is equal to the spectral radius of Ai. 

Proof. Choosing pfx) = p ^((c) = log ( ~~~ ) ensures, by Lemma fl9l that f A + p — 

poT A is constant when restricted to X Al — Ti, and the constant value assumed by this 
function is clearly f A (p Al )- The result will follow if we can show that f' A + p — p oT A 
is strictly increasing on X Ao , and that the value (f A + p — p o T A ) ( a "'f co ) at the right 
endpoint of X Ao is no greater than the constant value f A (p A J- This is because the Dirac 
measure S PAi will then clearly be the unique maximizing measure for f A + p~p° T 4 , 
and hence the unique maximizing measure for f A . 

To compute the value (f A + p — p o T A )( a f ) we recall from (1661) that 


/a 


a 0 


ao + Cq 


= f A (T Ao ( 1)) = log(a 0 + c 0 ), 


and note that 




a 0 


ao + Cq 


and 


<P 


ao 


ao + Cq 


= log 


= ^(1) = log ( 1 + 0Al ) = log(l + g A l ), 

V QAi J 

( c o + «o(l + 0 A i)) 


ap 

ap+cp 


+ QAi 


Qa 1 


= log 


ao + Co 


Therefore 


By Lemma 


{fA + P ~ P 0 T a ) ( — \ = log (a 0 + c 0 (l + q a [) x ) 
\a 0 + C 0 / 


IaIpm) = log 


det A-i 


so 


and 


a 1 — ^i(l + 6 Al ) 

imply that the desired inequality 

Oq 


( 88 ) 


(89) 


(fA + f-V 0 T A) 


ao + Cq 


< Ia(p Ai ) 


is precisely the hypothesis (jHTjl . since 

(ao + c 0 (l + Q Al ) x ) (ai — 61 (1 + & A l)) 


ti(A) = 


det Ai 


by 

It remains to show that f A + p — p o T A is strictly increasing on X Ao . Suppose 
x G X Ao . We know by (fTO|) that 

det A 0 


f A (x) = log 


-ol A 0 (x + cr Ao ) 
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Now 


so 


if(x) = log 
<p( t a(z)) = log 

and therefore 

(fA + P ~ P°T A )(x) =log 
It therefore suffices to show that 

X n - 


X + q Ai 
QAi 

Sa 0 {x) + qa i 
Qa 1 

_det A 0 (x + g Al ) _ 

-ola 0 (x + a Ao )(S Ao (x) + g A J 

X + Q Al 


ola 0 {x + cr j 4 0 )(S' J 4 0 (a;) + g Al ) 
is strictly increasing. For this note that 


(90) 


c ( \ . (bo + do) x — b 0 (bo + d 0 — aAoQA^x + (ao — bo)gA 1 — bo 

^A 0 [x) + Qa i — —7—;—i—v + Qa-l — 


so 


■ ol Aq (x + CE 4 0 ) 
is seen to be the Mobius function 


~ol Ao (x + <Ja 0 ) 


x (->• 


X+ Q Al 


(bo + do — olaoQa^x + (ao — bo)QA 1 — bo 
which is known to be strictly increasing by Lemma [TJ1 

As a consequence of Theorem [6] we obtain: 


□ 


Corollary 14. If A & £ and t e M + are such that 

t > ti(A ), (91) 

then the Dirac measure at the fixed point p Al is the unique f A ^.) -maximizing measure; 
in particular, the joint spectral radius of Aft) is equal to the spectral radius oftA x . 

Proof. The assumption (l9Tf means, using (17X1) . that t\(A(t)) < 1, so the result follows 
by applying Theorem [6] with A replaced by Aft). □ 


10. STURMIAN MAXIMIZING MEASURES 

It is at this point that we make the extra hypothesis that the matrix pair A lies in 
the class D C £. By Lemma fl6f ii) we know that if A £ £ then f A is strictly increasing 
on X Ao and strictly decreasing on X Al ; the following result asserts that if we make the 
stronger hypothesis that A G £> then these monotonicity properties are inherited by 
all functions formed by adding a Sturmian transfer function <pr to f A . 

Proposition 7. Let A G T>. For each A-Sturmian interval T e X A , the function 
fA + <Pr- X A —* R is strictly increasing on X Ao , and strictly decreasing on X A , ■ 

Proof. First suppose x G X Ao . Let 0 = i 0 < ii < < ... be the sequence of all integers 

such that rf k (x) G X Ao . 

For k > 0, writing z = rf k (x) we see that if 1 < i < 4+i ~ 4 then rf(z) G X Al , and 
thus Tp(z) = T l Ai (z ), so that 

ik-\. i —ik — 1 ^fc+l— 'i'k — 1 oo 

V (nor‘)'W=/y Z )+ V (f A oT' A J(z)>f A (z)+fj(f A °T' A J(z), (92) 

i= 0 j=l i=l 
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where the inequality is because (/^ o T\ y(z) < 0 for all i > 1, by Lemma H6l Now 
z G Xa 0 , so (145|) in Lemma fl5l (iii) gives f^z) = —(z + cu 4 0 ) _1 (which is positive), 
and formula (l50j) from Corollary [7] gives X^i(/a ° r ^A 1 )'{ z ) — ( z + QAx)~ 1 (which is 
negative), so (192|) implies that 


^fc+1 ik 1 

X Ua ° rf )'{z) > 


i=0 


-1 1 
-1-• 

z + a A 0 z + g Al 


(93) 


However A G D, so qa 1 < cta 0 , and therefore the righthand side of (j93|) is positive, so 
we have shown that 

^fc+1 ^ k 1 

X] (/a ° T r )\ z ) > 0 • 

2=0 

It follows that for all k > 0, 


*fc+l —1 ik+l—i-k —1 

X (/a ° r r) , ( x ) = ( r r0 , ( a; ) X ° T x)'( z ) > 0 > 

n—ik i —0 

and hence 

OO OO 1 

(/a + Pr)'(z) = X^- 4 ° r r) / ( a; ) = XI X r r)'( x ) > 0 > 

n=0 k= 0 

so /a + <£r is strictly increasing on X^ 0 . 

Now suppose x G X^. The proof proceeds analogously to the above. Let 0 = jo < 
j i < j 2 < ... be the sequence of all integers such that r^ k (x) G X^. 

For k > 0, writing z = r^ k (x) we see that if 1 < i < jbc+i — jk then r^(z) G X^ 0 , and 
thus Tp(z) = T\ o (z ), so that 

J'fc + l —jk — 1 Jfc+1 —jk — 1 OO 

E (/W)'w = /»+ E (u»u,)'w</J(*)+E(u°uj(*-). (94) 

2=0 2=1 2=1 

using the fact that (/^ °n 0 )'W > 0 for all i > 1, by Lemma [161 The righthand side of 
(l94|) can be written as —(z + oyq) -1 + (z + £M 0 ) -1 using Lemma [T5l (iii) and Corollary 
[3 and this is strictly negative since oa 1 < Qa 0 because A G D, so we have shown that 


jk+i-jk-i 

X ° r r) / (^) < 

2=0 


-1 


H-< 0 

z + a Al z + g Ao 


It follows that for all k > 0, 


and hence 


—1 

Z(fAOT?)'( X ) 

n =jk 


jk+l-jk-1 

(t^)'{x) X ( fA ° x r)'( z ) < 0 , 
2=0 


(/a + <PrY(x) = YifA ° t t)'( x ) 

n=0 


OO jk+l~ 1 

X X(^° T r)'( x ) < 0 > 

k =0 n=j fe 


so /a + <£r is strictly decreasing on X^ x . 


□ 
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Theorem 7. Let A G D and t G T A = (£o(*4)j ti(-A)) ■ The A-Sturmian measure 
supported by the A-Sturmian interval r^(t) is the unique maximizing measure for f A (t); 
thus the corresponding Sturmian measure onfl = {0,1} N is the unique Aft)-maximizing 
measure. 

Proof. Let us write (p = <Pr A (t) and T = T A — T A ^■ We know that f A ( t ) + p> — p>°T is & 
constant function when restricted to T A (t) = fit At] i by Proposition [6j In particular, 

(/U(t) + - (p o T) (7 t “) = (f A{t ) + <p - <p o T) (7 t + ) , 

and because Tf^f) = T( y t + ), we deduce that 

+ <p) {it ) = Um + t) [it ) • (95) 

But Proposition [7] implies that f A (t) + p> is strictly increasing on X Ao , and strictly 
decreasing on X Al , so together with (j95j) we deduce that 

(fA(t) + <p)(x) > (f A (t) + <p)(y) for all x G V A (t), y G X A \ V A (t). (96) 

Consequently, if z,z' are such that T(z) = T(z'), with z G T A ft) and z' ^ r^(t), 
then 

(/U(t) + V){z) > ( fA(t) + ¥»)(«'), 

and hence 

(/U(t) + ip - tp o T)(z) > ff A (t) + (p~<p o T)(z'). 

In other words, the constant value of f A t t \ + <p — p oT on T^(t) is its global maximum, 
and this value is not attained at any point in X A \r A (t). 

It follows that the Sturmian measure supported by r^(t) is the unique maximizing 
measure for f A ^) + ip — (poT, and hence the unique maximizing measure for f A (t). Thus 
the corresponding Sturmian measure on 17 = {0,1} N is the unique A(t )-maximizing 
measure. □ 

Recall from IQ] that (£ C M 2 (M) 2 denotes the set of matrix pairs which are equivalent 
to some pair in D, where equivalence of A — (A 0 ,Ai) and A! = (A[, A\) means that 
A' 0 = uP~ 1 AqP and A\ = uP _1 R 1 P for some invertible P and u,v > 0. We deduce 
the following theorem: 

Theorem 8. If A G (E and t G M + ; then Aft) has a unique maximizing measure, and 
this maximizing measure is Sturmian. 

Proof. It suffices to prove the result for A G T), and this is immediate from Corollaries 
mi and [TO Theorem [7] and the fact that D C £. □ 

11. The parameter map is a devil’s staircase 

As noted in Remark [7] if A G £ then there is a topological conjugacy h A : 17 —» Y A 
between the the shift map cr : 17 —> 17 and the restriction of T A to the Cantor set 
Y a C X a ; the map h A is strictly increasing with respect to the orders on 17 and Y A 
(cf. Remark ITTT) . If d : 17 —» [0,1] is as in Proposition |T|(c), associating to ca G 17 the 
Sturmian parameter of the measure supported by [Oca, lea], then the map d A : Y A —> 
[0,1] given by d A = d o hjf enjoys the same properties as d: 

Lemma 26. The map d A : Y A —>■ [0,1] is continuous, non-decreasing, and surjective. 
The preimage dfffP) is a singleton if V is irrational, and a positive-length closed 
interval ifV is rational. 
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Proof. Immediate from Proposition [Tj(c), and the fact that Jia is strictly increasing. □ 

Note that d a associates to y G Y 4 the parameter of the M-Sturmian measure sup¬ 
ported by the M-Sturmian interval cj A 1 (y), where we recall from Definition d2] that the 
identification map ca : Ta —* [0,1] is defined by c^(r) = T^rninP) = ^(maxT). Of 
the extensions of the function d a from the Cantor set Y 4 to the interval X = [0,1], 
there is a unique one giving a non-decreasing self-map d^ : X —> [0,1]. This exten¬ 
sion, which we shall also denote by d^, is continuous, and d^{c) is just the parameter 
of the M-Sturmian measure s c -i( c ) (he. of the M-Sturmian measure supported by the 
M-Sturmian interval c~^(c)) for each c G X. We therefore have the following: 

Corollary 15. The map d^ : X —> [0,1] is continuous, non-decreasing, and surjective. 
The preimage df^ifP) is a singleton if V is irrational, and a positive-length closed 
interval ifV is rational. 

Definition IT. For A G £>, let 'Pyx(t) denote the parameter of the Sturmian maximizing 
measure for A(t), or equivalently of the M-Sturmian fA(ty maximizing measure. This 
defines the parameter map Va ■ IP + —> [ 0 , 1 ]. 

Recalling (see Proposition [ 6 ]) the map t ha r^(t) associating M-Sturmian interval to 
parameter t G Ta = (to(^), ti(-4)), we see that in fact the map Va • Ta —>■ X can be 
written as 

V A = d A °c A °TA- ( 97 ) 

This means that Va will enjoy the same properties as established for in Corollary 
IT5l provided c/oT^ is strictly increasing: 


Lemma 27. For A G the map ca°Ta : Ta —> X is strictly increasing and surjective. 


Proof. Recall from d79li the function Ga given by 

G A (t) = log 


^0 + <h) \ 1 


v b\ + d\ 

and that r^(t) G Ta is dehned (see (15T1P by the identity 

A a 0 = Ga ■ 


Now Ga is strictly decreasing, so in particular injective, therefore the map T^ is nec¬ 
essarily injective. Note that clearly extends to a continuous injection on Ta = 
[t 0 (.A),ti(M)], with r A (ti(A)) = Ti for i G {0, 1}_ 

Now c _4 :1 a —^ X is a bijection, so C 4 o : Ta —> X is injective, and its continuity 
means it is strictly monotone. But c^(r^(f 0 (-^))) = 0 and c^(r^(ti(^4))) = 1, so the 
map C 4 o r _4 must be strictly increasing and surjective, as required. □ 


We can now prove that the parameter map Va ■ 1P + —>> [0,1] is singular. More 
specifically, its properties described by the following Theorem [U] mean it is a devil’s 
staircase. These properties of the parameter map had been noted by Bousch & Mairesse 
[5] in the context of the family (l3|) . and proved in detail by Morris & Sidorov [T7] for the 
family (JSJ) • The following result can be viewed as a more detailed version of Theorem 
[3] from m 


Theorem 9. If A G (£ and t G M + , then A(t) has a unique maximizing measure, and 
this maximizing measure is Sturmian. Let ^(t) denote the parameter of the Sturmian 
maximizing measure for A{t). The parameter map Va ■ IP + —> [0,1] is continuous, 
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non-decreasing, and surjective. The preimage V^ifP) is a singleton ifT is irrational, 
and a positive-length closed interval ifV is rational. 

Proof. The set l£ consists of matrix pairs which are equivalent to a matrix pair in £), 
so it suffices to prove the result for A G D. Theorem [H] gives that A{t) has a unique 
maximizing measure, and that this maximizing measure is Sturmian. 

For t G R + \ Ta we know that 

TU(t) = 0 for t G (0,f 0 (*4.)) (98) 

by Theorem [T31 and 

T'U(t) = 1 for t G (ti(Al), 00 ) (99) 

by Theorem fl4l since the Dirac measures at the fixed points pa 0 and pa 1 are .A-Sturmian 
measures of parameters 0 and 1 respectively. 

In view of (195|) and , it suffices to establish the required properties of Va on the 
sub-interval Ta = (to(-4); U(A)). Using the factorisation (IfJTD . we see that this follows 
from Corollary [To] and Lemma [23 □ 
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